Interactive Multi-Label Segmentation (web-version)
نویسنده
چکیده
Interactive image segmentation deals with partitioning an image into multiple pairwisedisjoint regions based on input provided by a human operator. Being interactive means, that an algorithm has to quickly react on user input, which limits the computational complexity of the employed algorithms drastically. Therefore, many interactive segmentation methods represent these regions with simple models based on low-dimensional feature spaces, which in turn introduces a limitation in terms of expressibility of these models and thus segmentation quality. Furthermore, most methods can only handle the two-label case, i.e. the segmentation of an image into foreground and background. In this work, we investigate the incorporation of arbitrary high-dimensional features in an interactive multi-label segmentation framework. With such high-dimensional features, not only color and grayvalue information, but also complex textural properties of a region can be modeled. In order to not violate the runtime constraints, we carefully select the building blocks of our framework according to their ability of being implemented on parallel architectures: We employ Haralick texture features and Local Binary Patterns to represent local image structure, as well as Random Forests as learning algorithm. Finally, we employ a multi-label Potts regularizer in order to obtain spatially compact image segments. All these parts are implemented on the GPU or multi-core CPUs in order to achieve runtimes that allow for convenient user interaction. We furthermore address the problem of comparatively evaluating interactive multilabel segmentation algorithms and introduce a large novel benchmark dataset. With this dataset, we perform detailed experiments in order to evaluate the performance and runtime of the building blocks of our framework. We show the benefit of incorporating texture and color features over employing color features alone. Finally, we compare our framework to the state-of-the-art Power Watersheds method and highlight advantages and drawbacks of both approaches.
منابع مشابه
Interactive Multicut Video Segmentation
Video segmentation requires separating foreground from background, but the general problem extends to more complicated scene segmentations of different objects and their multiple parts. We develop a new approach to interactive multi-label video segmentation where many objects are segmented simultaneously with consistent spatio-temporal boundaries, based on intuitive multi-colored brush scribble...
متن کاملInteractive Multi-label Segmentation
This paper addresses the problem of interactive multi-label segmentation. We propose a powerful new framework using several color models and texture descriptors, Random Forest likelihood estimation as well as a multi-label Potts-model segmentation. We perform most of the calculations on the GPU and reach runtimes of less than two seconds, allowing for convenient user interaction. Due to the lac...
متن کاملInteractive Multi-label Segmentation of RGB-D Images
We propose a novel interactive multi-label RGB-D image segmentation method by extending spatially varying color distributions [14] to additionally utilize depth information in two different ways. On the one hand, we consider the depth image as an additional data channel. On the other hand, we extend the idea of spatially varying color distributions in a plane to volumetrically varying color dis...
متن کاملConvex Optimization for Image Segmentation (print-version)
Segmentation is one of the fundamental low level problems in computer vision. Extracting objects from an image gives rise to further high level processing as well as image composing. A segment not always has to correspond to a real world object, but can fulfill any coherency criterion (e.g. similar motion). Segmentation is a highly ambiguous task, and usually requires some prior knowledge. This...
متن کاملRobust Interactive Multi-label Segmentation with an Advanced Edge Detector
Recent advances on convex relaxation methods allow for a flexible formulation of many interactive multi-label segmentation methods. The building blocks are a likelihood specified for each pixel and each label, and a penalty for the boundary length of each segment. While many sophisticated likelihood estimations based on various statistical measures have been investigated, the boundary length is...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010